A corpus based approach to generalising a chatbot system

نویسنده

  • Bayan Abu Shawar
چکیده

International research in NLP is dominated by work on English. NLP techniques and systems can be ported to other natural languages, but this is generally a labour-intensive task, requiring scarce computational and linguistic expertise; hence minority languages are poorly represented in NLP technology. We present an automated approach to porting an NLP technology, the AIML-based chatbot, to new languages, by using a corpus in the target language to retrain the chatbot. We have successfully automated production of chatbots talking French, and Afrikaans; and are developing further demonstrators in Spanish and Arabic.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ALICE Chatbot: Trials and Outputs

A chatbot is a conversational agent that interacts with users using natural language. Multi chatbots are available to serve in different domains. However, the knowledge base of chatbots is hand coded in its brain. This paper presents an overview of ALICE chatbot, its AIML format, and our experiments to generate different prototypes of ALICE automatically based on a corpus approach. A descriptio...

متن کامل

Using the Corpus of Spoken Afrikaans to generate an Afrikaans chatbot

This paper presents two chatbot systems, ALICE and Elizabeth, illustrating the dialogue knowledge representation and pattern matching techniques of each. We discuss the problems which arise when using the Corpus of Spoken Afrikaans (Korpus Gesproke Afrikaans) to retrain the ALICE chatbot system with human dialogue examples. A Java program to convert from dialog transcripts to the AIML linguisti...

متن کامل

A Chatbot as a Novel Corpus Visualization Tool

The classical way of viewing data set is using the visualization process, which maps the data from numerical or textual form to a visual representation that our mind can easily interpret such as: using graphical diagrams, charts, and geometric representation. In this paper we introduce a new idea to visualize a dialogue corpus using a chatbot interface tool. We developed a java program to conve...

متن کامل

A data-driven model of explanations for a chatbot that helps to practice conversation in a foreign language

This article describes a model of otherinitiated self-repair for a chatbot that helps to practice conversation in a foreign language. The model was developed using a corpus of instant messaging conversations between German native and non-native speakers. Conversation Analysis helped to create computational models from a small number of examples. The model has been validated in an AIML-based cha...

متن کامل

Chatbots: Can They Serve as Natural Language Interfaces to Qa Corpus?

A chatbot is a program which can chat in natural language, on a topic built into the chatbot’s internal knowledge model. Many chatbots exist, with different knowledge-bases programmed by the chatbot builders. We have built a system to convert a website text (corpus) to a chatbot knowledge-base format. In this paper the chatbot is used as a question answer interface, where TRE09 QA track is used...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Procesamiento del Lenguaje Natural

دوره 31  شماره 

صفحات  -

تاریخ انتشار 2003